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proved to be flat coordinates of the usual probabilities for the derived dually flat 
structure. Finally, we demonstrate that escort probabilities with the new structure 
admits a simple algorithm to compute Voronoi diagrams and centroids with respect to 
alpha-divergences. 
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1. Introduction 

Escort probability is naturally induced from researches of multifractals [1] and non- 
extensive statistical mechanics [2 to play an important but mysterious role. Testing 
its utility in the other scientific fields would greatly help our understanding about it. 
This motivates us to approach the escort probability by geometrically studying its role 
in information science. 

The first purpose of this paper is to investigate the escort probability from 
viewpoints of information geometry [31 H] and afline differential geometry [5]. The 
second is to show that escort probability with information geometric structure is useful 
to construction of Voronoi diagrams [6] on the space of probability distributions. 

Recently, it is reported [7J |8] that a-geometry, which is an information geometric 
structure of constant curvature, has a close relation with Tsallis statistics [2]. The 
remarkable feature of the a-geometry consists of the Fisher metric together with a one- 
parameter family of dual afline connections, called the a-connections. 

We prove that the manifold of escort probability distributions is dually flat by 
considering conformal transformations that flatten the a-geometry on the manifold of 
usual probability distributions. On the resultant manifold, escort probabilities consist 
of an affine coordinate system. See also [9] for another type of flattening a curved dual 
manifold by a conformal transformation. 

The result gives us a clear geometrical interpretation of the escort probability, and 
simultaneously, produces its new obscure links to conformality and projectivity. Due 
to these two geometrical concepts, however, the obtained dually flat structure inherits 
several properties of the a-geometry. 

The dually flatness proves crucial to construction of Voronoi diagrams for a- 
divergences, which we shall call a- Voronoi diagrams. The Voronoi diagrams on the space 
of probability distributions with the Kullback-Leibler [TOl ITT], or Bregman divergences 
[T2] have been recognized as important tools for various statistical modeling problems 
involving pattern classification, clustering, likelihood ratio test and so on. See also, e.g., 
[TBI ITTl [T5] for related problems. 

The largest advantage to take account of a-divergences is their invariance under 
transformations by sufficient statistics [16] (See also [I] in a different viewpoint), which 
is a significant requirement for those statistical applications. In computational aspect, 
the conformal flattening of the a-geometry enables us to invoke the standard algorithm 
[29] [6] using a potential function and an upper envelop of hyperplanes with the escort 
probabilities as coordinates. 

Section 2 is devoted to preliminaries for a-geometry in the light of affine differential 
geometry. In section 3, as a main result, we consider conformal transformations and 
discuss properties of the obtained dually flat structure. Dual pairs of potential functions 
and afline coordinate systems on the manifold are explicitly identified, and the associated 
canonical divergence is shown to be conformal to the a-divergence. Section 4 describes 
an application of such a flattened geometric structure to a- Voronoi diagrams on the 
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probability simplex. The properties and a construction algorithm are discussed. Further, 
a formula for a-centroid is touched upon. 

In the sequel, we fix the relations of two parameters q and a as q = (1 — a)/2, and 
restrict q > 0. 



2. Preliminaries 

We briefly introduce a-geometry via affine differential geometry. See for details [TJ [8] . 
Let S n denote the n-dimensional probability simplex, i.e., 



S n :={p=(Pi) 



n+l -\ 

Pi > 0, J2 Pi = 1 \ 
1=1 J 



(1) 



r£ )fc (p) = ^ [--Sfi+PHgij ), i,j,k = l,---,n, (4) 



and pi,i = 1, ■ • • ,n + 1 denote probabilities of n + 1 states. We introduce the a- 
geometric structure on S n . Let {<9j}, i = 1, ■ ■ • , n be natural basis tangent vector fields 
on S n defined by 

di := j- - - , i = l, •••,n, (2) 

where = 1 — X^iLiPi- Now we define a Riemannian metric g on 5™ called the Fisher 
metric: 

1 1 

9ij(p) ■= g(di, dj) = —dij + (3) 

Pi Pn+1 

n+l 

= ^Pkidilogp^idjlogpk), i,j = l,---,n. 
k=i 

Further, define an torsion-free affine connection V^- 1 called the a-connection, which is 
represented in its coefficients by 

a ( 1 

where 8^ is equal to one if i — j — k and zero otherwise. Then we have the a-covariant 
derivative V^, which gives 

n 

fe=i 

when it is applied to the vector fields di and dj. 

There are two specific features for the a-geometry on S n defined in such a way. First, 
the triple (S n ,g, V (a) ) is a statistical manifold [TF] (See appendix A for its definition), 
i.e., we can confirm that the following relation holds: 

Xg(Y,Z) = g(V i « ) Y,Z)+g(Yy k Q) Z), X,Y,ZeX(S n ), (5) 

where X(S n ) denotes the set of all tangent vector fields on S n . Two statistical manifolds 
(S n , g, V 1 -")) and (S n , g, V ( -~ a - ) ) are said mutually dual. 

The other is that (S n ,g, V (q) ) is a manifold of constant curvature k = (1 — « 2 )/4, 

i.e., 

PS a \X, Y)Z = K,{g(Y, Z)X - g{X, Z)Y}, 
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where is the curvature tensor with respect to V^. From this property the well- 
known nonadditive formula of the Tsallis entropy can be derived [7]. 

In [8] we have discussed the a-geometry on S n from a viewpoint of affine differential 
geometry [5]. Consider the immersion / of S n into R™ +1 by 

f:p=(p t )^x = (x i ) = (L^\p l )), i = l r ..,n + l, (6) 

where (x % ),i = 1, • • • ,n+ 1 is the canonical flat coordinate system of R n+1 and the 
function is defined by 

1 — a q 

Note that f(S n ) is a level hypersurface in the ambient space R + +1 represented by 
*(aj) = 2/(1 + a), where 

We choose a transversal vector £ on the level hypersurface by 

n+l a 

e:=Ee ! ^-, r = - = -KarV (8) 

Then we can confirm that the affine immersion (/, £) realizes the a-geometry on5" [8]. 
Hence, it would be possible to develop theory of the a-geometry and Tsallis statistics 
with ideas of affine differential geometry [18J. 

Further, the escort probability [l] naturally appears in this setup. The escort 
probability P = (Pi) associated with p = (p,) is the normalized version of (pi) q , and is 
defined by 

( n .\g r i n+l 

p ^p) ■= V* = < = i. ■ ■ • . " + 1. Up) ■= E Ap), *(P) 6 (9) 

Hence, the simplex S n in the ambient space R + +1 , i.e., 
S n ■= lx = (x i 



n+l 

Y, x i = 1, x l > 

1=1 



represents the set of escort distributions P. 

Note that the element x* = (x*) in the dual space of R n+1 defined by 

1 



meets 



x*(p) := L^\ Vi ) = (p,) 1 "*, i = 1, • • • ,n + 1, 

1 - q 



x *(p) = q^Mp))- 



Hence, it satisfies [8] 

n+l n+l 

-Ee(pK(p) = i, e<(p)^ = °. ( 10 ) 

i=l i=l 

for an arbitrary vector X = Yli=i X l d/dx l at x(p) tangent to f(S n ). Thus, — x*(p) 
can be interpreted as the conormal map [5]. 
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/-i : (p.) ^ (p.) = [ ] , i = l,. ..,„ + !. 



3. A conformally and projectively flat geometric structure and escort 
probabilities 

In this section we show a main result. For this purpose, we consider a conformal and 
projective transformation \19\ [20l EH E2] of the a-geometry to introduce a dually flat 
one. This flattening of the a-geometry conserves some of its properties. The escort 
probabilities (Pi) are found to represent one of mutually dual affine coordinate systems 
in the induced geometry. While the many functions or geometric quantities introduced 
in this section depend on the parameter a or q, we omit them for the brevity. 
Let us define a function A on S n by 

A(p) := Y q = TEtLMfay 

which depends on a. Then, from (JH]) £ n is regarded as the image of S n for another 
immersion / := Xf, i.e., 

f:S n 3(p l )^(P l )eS n , z = l,...,n+l, 

and (Pi, • • • , P n ) is interpreted as another coordinate system of S n . Note that the inverse 
mapping j~ x is well-defined by 

(P) 1/g 

It would be a natural way to introduce geometric structure on £ n (and hence on «S n ) 
via the affine immersion (/, £) by taking a suitable transversal vector £, similarly to the 
case of the a-geometry mentioned above. Since S n is a part of a hyperplane in R n+1 , 
the canonical affine connection of R n+1 induces a flat connection, denoted by D^ E \ on 
S n . However, for the same reason, we cannot define a Riemannian metric in this wa}j§| 
because it vanishes on S n , regardless of any choice of the transversal vector £. 

The idea we adopt here is to define a Riemannian metric by utilizing a property 
of (S n , g, V^) called —1-conformal flatness. Based on the results proved by Kurose 
[13120], we conclude that the manifold (S n , g,V {a) ) is ±l-conformally flat (See Appendix 
A for its definition) because it is a statistical manifold of constant curvature. 

Actually, let V* be the flat connection^] on S n defined with and the differential 
/* by 

f.(y* x Y) = DfJ.Y, X,Ye X{S n ). 
Then, we can prove that and V* are projectively equivalent i.e., it holds that 

V* X Y = V ( x ] Y + d{ln\){Y)X + d{\n\){X)Y, X,Y e X(S n ). (11) 
Hence, if we define another Riemannian metric h on S n by 

h(X,Y):=Xg{X,Y), X, Y e X(S n ), (12) 

§ In affine differential geometry, a Riemannian metric is realized as the affine fundamental form of an 
affine immersion [5]. 

|| For the sake of notational consistency with the existing literature, e.g., OH], we first define V*, and 
later V as the dual of V*. 
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then, (S n ,g, V^) is — 1-conformally equivalent to (S n ,h, V*) equipped with a flat 
connection V*. Further, the manifold (S n ,h, V*) can be proved to be a statistical 
manifold (See Appendix B). 

Using the conormal map — x*(p), we can define the a-divergence as a contrast 
function (See Appendix A) inducing (g, V^, V* - "* 1 ) as follows [20]: 

n+l 

£>M(p,r)= -EzftrXs'ClO-sV)) 

i=l 

= (-**(»*), » (p) - 3;(r-)) = - - (aj*(r), aj(p)). 

Kj 

The statistical manifolds (5™, g, V^"-*) and (5™, #,V (q) ) are dual in the sense of 
(E]). Further, it is known [4 J that there exists the unique affine flat connection V on 
S n , dual with respect to (h, V*). Then, according to |20j, it is proved that (S n , h, V) is 
1-conformally equivalent to (S n , g, V^ - "**) and a contrast function p inducing (h, V, V*) 
is given by scaling _D(~ Q ) (See Appendix A) as follows: 

p(p,r) = X(r)D^ a \p,r) = -L-D^(p,r) 

1 (-x(r),x*(p)-x*(r)) = (-P(r),x*(p)-x*(r)). (13) 



Z q (r 



We shall call p a conformal divergence. 

Now, since (S n , h, V, V*) is a dually flat space, the standard result in [3], H] suggests 
that there exist mutually dual affine coordinate systems (9 1 , ■ ■ ■ , 6 n ) and (r/i, • • ■ , 1]„), a 
potential function ip(0) and its conjugate ip*{rf) satisfying 

• <"> 

They completely determine dually flat structure, i.e., the coefficients of h, V and V* 
are derived as the second and third derivatives of ip or x/j*, for example, 

,fdd\ d 2 *p uii / 9 d\ d 2 r 
Kj = h[ — , — I = t^t^t , h lJ =h\—, 



pd v dQi) 90 W {drii'drjjj 9^9%' 

9 9 \ / 9 9 \ 9 s -?/' 



r«* = M Vjl— , — = o, r* = & v 



and so on. In order to identify if),ip*,8 l and r/j explicitly without integrating or 
h % \ we shall search for them by examining whether the conformal divergence p can be 
represented in the form of the canonical divergence jl], i.e., 

n 

p(p,r) = i>(0(p))+r(v(r))-Y, di (P)Vi(r)- (15) 

i=l 

with the constraints (|14p . If this is possible, we can directly prove from ()A.4j) and flA.5[) 
that the obtained ip,ip* , (9 1 , ■ ■ ■ , 6 n ) and (771 , • • • , T] n ) are pairs of dual potential functions 
and affine coordinate systems associated with (S n , h, V, V*). 

Before showing the result, we define, for < q with q ^ 1, two functions by 

ln,(s) := s > 0, exp q (t) := [1 + (1 - q)^ 1 ^, t G R, 
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where [t] + := max{0,£}, and the so-called Tsallis entropy [23] by 

S q ( P ) ■= — q ■ 

Note that s = exp g (ln g (s)) holds and they respectively recover the usual logarithmic, 
exponential function and the Boltzmann-Gibbs-Shannon entropy — ^27=1 Pi m P« when 
q — > 1. For q > 0, ln 9 (s) is concave on s > 0. 

Theorem 1 For the dually flat space (S n , h, V, V*) defined via ±1- conformal 
transformation from (S n , g, V^, V^ _Q ^) ; the associated potential functions ip,ip* , and 
dually flat affine coordinate systems (9 1 ,---,9 n ) and (r]i, • ■ ■ ,r) n ) are represented as 
follows: 

0*(P) = <(p) z = l,---,n 

7fe(p) =-Pi(p), i = l,---,n 
V»(0(p)) = -K(Pn+i), 



1 



^(»»(p)) = -(A(p)-g) = r — 



a/<z 



where k = (1 — a 2 )/4 = g(l — g) zs ine scalar curvature of (S n , g, V^, V'- - "-') and 
r] n+1 := P n+1 (p) = 1 — Z)iLi-Pi(p)- Further, the coordinate systems (6* 1 , •••,#"■) and 
(771, • • • , r] n ) are V- and V* -affine, respectively. 

Proof) As is mentioned above we have only to check that the potential functions 
and dual affine coordinates 8\ r)i in the statement satisfy (JUT) and (115j) for the conformal 
divergence p. First, substitute them directly to the right-hand side of ffT5l) and modify 
it caring for the relation rj n+ \ = 1 — J27=i then we see that it coincides with p(p, r) in 
(|T3|) . Next, since it holds that ln ? (p 4 ) = x*(j>) — 1/(1 — q), we can alternatively represent 

6 l {p) = Irigipi) - ln 9 (p n+1 ) = kigipi) + ip(0(p)), i = 1, • • • , n. 

Hence, for 6* n+1 = it holds 

n+l n+1 

i = E^ = E ex p 9 (^-^)- 

8=1 8=1 



Differentiating the both sides by , j — 1, • • • , n, we have 



o=g(^-§)w' = fer-§gw 9 , 



Thus, the left equation of (I14p holds. Finally, note that the conformal factor is 
represented by 

1 _ g = q_ 

Z q {p) E^P*) 9 (exp q (S q (p) ))'-"' 
Using the formula [24J: 



A(P) = — = = TZZ-TT^ZvvTT^- (16) 



exp g (5 q (p)) = expi [Si(P 
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we see that 

A(p) = q (ex P| (SUP)^ 1 = q (eW) ■ 

Hence, the second equality in the expression of ip* holds. The right equation of (1141) 
follows if you again recall r] n +i = 1 — J2i=iVi- Q.E.D. 

Corollary 1 The escort probabilities Pi, i — 1, • • • , n are canonical affine coordinates of 
the flat affine connection V* on S n . 

Remark 1: Since the conformal factor A in (Tl6l) can be alternatively represented by 

A(P) = (exp^Ib))) 1 - = Klnq (exp fl (k(p))) + Q ' 
we have another expression of if)*, i.e, 

Thus, the potentials and dual coordinates given in the proposition recover the standard 
ones [31 H] when q — > 1, i.e, 

n+l 

^->-lnp n+ i, V* -^^PilogPi l -»■ log(pi/p n+ i), rii->Pii i = l,---,n. 

i=i 

Note that — coincides with the entropy studied in [251 12S1 l2Zj and referred to as the 
normalized Tsallis entropy. The conformal (or scaling) factor A often appears in the 
study of the g-analysis. 

Remark 2: Similarly to the above conformal transformation of (S n , g, V^), we can 
define another one for (S n , g, V*-~ a )) with a conformal factor 

X ' {P) := Egf^R' 

and construct another dually flat structure (/i 7 = A'g, V, V'*). Hence, the following 
relations among them hold (See Figured]). 

(S n ,h',V) ^ (S n ,h',V*) 
1-conformally equivalent £ £ — 1-conformally equivalent 

{S n ,g,V^) ^4 {S n ,g,V^) 
— 1-conformally equivalent £ £ 1-conformally equivalent 

{S n ,h,V*) £4 (5 n ,/i,V) 



Figure 1. Relations among geometries 



Remark 3: Because of the projective equivalence pip , a submanifold in S n is V^- 
autoparallel if and only if it is V*-autoparallel. In particular, the set of distributions 
constrained with the normalized g-expectations (escort averages) [2] is a simultaneously 
V^- and V*-autoparallel submanifold in S n . 
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4. Applications to construction of alpha- Voronoi diagrams and 
alpha-centroids 

For given m points p l3 - • • ,p m on S n we define a- Voronoi regions on S n using the a- 
divergence as follows: 

VorWfo) := f|{ pG 5 n |DW(p,p fc ) < DW(p lft )}, fc = l,---,ro. 

An a-Voronoi diagram on 5™ is a collection of the a- Voronoi regions and their 
boundaries. Note that approaches the Kullback-Leibler divergence if a — > — 1, 
and is called the Hellinger distance. If we use the Renyi divergence of order 1 
[28] defined by 

i n+l 

D a (p,r):= -ln^ferN 1 -, 

instead of the a-divergence, Vor*- 1-2 "^^) gives the corresponding Voronoi region 
because of their one-to-one functional relationship. 

The standard algorithm using projection of a polyhedron [2"9"j E] commonly works 
well to construct Voronoi diagrams for the Euclidean distance [6], the Kullback-Leibler 
[TT] and Bregman divergences [12], respectively. The algorithm is applicable if a distance 
function is represented by the remainder of the first order Taylor expansion of a convex 
potential function in a suitable coordinate system. Geometrically speaking, this is 
satisfied if i) the divergence is a canonical one for a certain dually flat structure and 
ii) its affine coordinate system is chosen to realize the corresponding Voronoi diagrams. 
In this coordinate system with one extra complementary coordinate the polyhedron is 
expressed as the upper envelop of m hyperplanes tangent to the potential function. 

A problem for the case of the a- Voronoi diagram is that the a-divergence on 
S n cannot be represented as a remainder of any convex potentials. The following 
theorem, however, claims that the problem is resolved by conformally transforming the 
a-geometry to the dually flat structure (h, V, V*) and using the conformal divergence 
p and escort probabilities as a coordinate system. 

Here, we denote the point on £ n by P = (P 1; • ■ • , P n ) because P n+ \ = 1 — J2i=i Pi- 
Theorem 2 i) The bisector of p k and p l defined by {p\D^ a \p,p k ) = D^{p,p t )} is 

a simultaneously V^- and V* -autoparallel hypersurface on S n . 

ii) Let T-Lk, k — 1, ■ • ■ ,m be the hyperplane in £ n x R which is respectively tangent at 
(Pk,ip*{Pk)) to the hypersurface {(P,y)\y = ip*(P)}, where P k = P{Pk)- The 
a-Voronoi diagram can be constructed on £ n as the projection of the upper envelope 
of Hk 's along the y-axis. 

Proof) i) Consider the V* -0 -* -geodesic 7 (_a ) connecting p k and p h and let p be 
the midpoint on ^~ a "> satisfying D^(p,p k ) = D^ip.p^). Denote by B the V^- 
autoparallel hypersurface that is orthogonal to and contains p. Then, for all 
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0.2 0.4 0.6 0.8 

Usual probability pi 




0.2 0.4 0.6 0.8 

Escort probability PI 



. 8 



0.6 



. 4 



0.2 



Figure 2. An example of a-Voronoi diagram on S 2 (left) for a = 0.6 (or q = 0.2) and 
the corresponding one on £ 2 (right). 




0.2 0.4 0.6 0. 

Usual probability pi 




0.2 0.4 0.6 0. 

Escort probability PI 



Figure 3. An example of a-Voronoi diagram on S 2 (left) for a = —2 (or q = 1.5) and 
the corresponding one on £ 2 (right). 



r G B, the modified Pythagorean theorem [201 U\ implies the following equality: 
D^(r,p k ) = Z>to(r,p) + DM(p,p h ) - KD^(r,p)D^(pp k ) 

= D^{r,p) + D^(p, Pl ) - K D^(r,p)D^(p, Pl ) = D^(r, Pl ). 

Hence, B is a bisector of p k and p v The projective equivalence ensures that B is also 
V*-autoparallel. 

ii) Recall the equality D^ a \p,r) = D^~ a '(r,p) and the conformal relation ffT3l) 
between D^~°^ and p, then we see that Vor (Q) (p fc ) = Vor (conf) (p fc ) holds on <S™, where 

Vor^\p k ) :=f]{pe S n \p(Pk,P) < p(Pi,p)}- 
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Theorem [TJ relations (THj) and ( 1T5|) imply that p(p k ,p) is represented with the 
coordinates (Pj) by 

p( Pk ,p) = r(p) - (r(p k ) + e ^(p k )(Pi(p) - p*(p*))) , 

where P = P{p). Note that a point (P, yk(P)) in is expressed by 

y*(P) := r(Pk) + E -Z;(P*)(Pi(p) ~ PiiPk))- 

Hence, we have p(p k ,p) = ip*(P) — y k (P). We see, for example, that the bisector on 
£ n for p k and p l is represented as a projection of "H^ n"%. Thus, the statement follows. 
Q.E.D. 

The figure 2 and 3 show examples of a-Voronoi diagrams on the simplex of 
dimension 2. In these cases, the bisectors are simultaneously V^- and V*-geodesics. 

Remark 4'- I n [3Q] Voronoi diagrams for broader class of divergences (contrast 
functions) that are not necessarily associated with any convex potentials are studied from 
more general affine differential geometric points of views. The construction algorithm is 
also given there, which is applicable if the corresponding affine immersion is explicitly 
obtained. 

On the other hand, the a-divergence defined not only on S n but on the positive 
orthant R™ +1 can be represented as a remainder of the potential \I/ in (JTJ) pl HJ [8]. 
Hence, the a-geometry on R™ +1 is dually flat. Using this property, a- Voronoi diagrams 
on R™ +1 is discussed in [3Tj . 

While both of the above methods require computation of the polyhedrons in the 
space of dimension n + 2, the new one proposed in this paper does in the space of 
dimension n + 1. Since the optimal computational time of polyhedrons depends on the 
dimension d by 0(mlogm + m\ d ^ 2 ^) [32], the new one where d = n + 1 is slightly better 
when n is even. 

The next proposition is a simple and relevant application of escort probabilities. 
Define the a-centroid c^ a > for given m points p 1 , ■ ■ ■ ,p m on S n by the minimizer of the 
following problem: 

m 

minE^ (a) (P fc ,P)- 

pes k=i 

Proposition 1 The a-centroid for given m points p l5 ■ • • , p m on S n is represented 
in escort probabilities by the weighted average of conformal factors X(p k ) = 1/Z q (p k ), 
i.e., 

i m 

Pi(c^) = E Z g (p k )Pi(p k ), i = 1, • • - ,n + 1. 

2^=1 ^qVPk) k=l 

Proof) Let 9 i = 0*(p). Using (JE), (jE) and the relation D^ a \p,r) = D ( ~ Q) (r,p), we 
have 

jry fyi fyi y~i 

E^ (a) (p fc >p) = £^(p*Mp.p*) = Y, Z MW) + r(ri(p k )) -E^(p*)}- 

fc=l fc=l fc=l i=l 
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Then the optimality condition is 

am m 

— £ D^(p k , P ) = £ z q ( Pk )( Vi - =o, i = i, ■ ■ 

90 fc=i fc=i 
where r/j = r/j(p). Thus, the statement follows from Theorem [1] for z = l,---,n. For 
z = n + 1 it follows from the fact that the sum of the weights is equal to one. Q.E.D. 

5. Concluding remarks 

We have considered ±l-conformal transformations of the a-geometry and obtained 
dually flat structure (S n , h, V, V*). Further the potential functions and dually flat 
coordinate systems associated with the structure have been derived. We see that the 
escort probability naturally appears to play an important role. 

From a viewpoint of contrast functions, the geometric structure compatible to the 
Kullback-Leibler divergence is (S n , g, V^, V*- -1 ^), where g is the Fisher information 
and V (±1) are respectively the e-connection and the m- connection. Similarly, the a- 
divergence (or the Tsallis relative entropy), and the conformal divergence p in this note 
correspond to (S n , g, V^, V*- - "-') and (S n , h, V, V*), respectively. They are summarized 
in Figure HI 

KL divergence a-divergence conformal divergence 

(S n ,g,V^\V^) <— > (S n ,gM a) M~ a) ) <— > (S n , K V, V*), (<S n , h', V, V'*) 
dually flat constant curvature k dually flat 

Figure 4. transformations of dualistic structures 

The physical meaning or essence underlying these transformations would be 
interesting and significant, but is left unclear. (See recent publications (331 El] f° r 
such research directions.) 

Finally, we have shown a direct application of the conformal flattening to 
computation of a-Voronoi diagrams and a-centroids. Escort probabilities are found 
to work as a suitable coordinate system for the purpose. 
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Appendix A: Statistical manifold and «-conformally equivalence 

For details of this appendix see [T71 [191 EH [ZD [22] • For a torsion- free affine connection 
V and a pseudo Riemannian metric gona manifold Ai, the triple (M., g,V) is called a 
statistical manifold if it admits another torsion-free connection V* satisfying 

Xg{Y, Z) = g(V x Y, Z) + g(Y, V* X Z) (A.l) 
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for arbitrary X, Y and Z in X(At), where X(Ai) is the set of all tangent vector fields on 
At. It is known that (Ai, g, V) is a statistical manifold if and only if Vg is symmetric, 
i.e., (Vxg){Y, Z) is symmetric with respect to X,Y and Z. We call V and V* duals 
of each other with respect to g, and (Ai,g, V*) is said the dual statistical manifold of 
(Ai,g, V). The triple of a Riemannian metric and a pair of dual connections (g, V, V*) 
satisfying flA.lj) is called a dualistic structure on Ai. 

For a G R, statistical manifolds (Ai,g,V) and (Ai,g', V) are said to be a- 
conformally equivalent if there exists a positive function on At such that 

</(X,Y) =<f>g{X,Y), 

1 + a 

g(V x Y,Z) = g(V x Y,Z) - — - — d(\n<f>)(Z)g(X, Y) 

+ l -^{d{\n<t>){X)g{Y,Z) + d{\^){Y)g{X ) Z)}. 

Statistical manifolds (At,g, V) and (Ai, g', V) are a-conformally equivalent if and only 
if (Ai,g,V*) and (Ai,g, V'*) are — a-conformally equivalent. 

A statistical manifold (.M,g,V) is called a-conformally flat if it is locally a- 
conformally equivalent to a flat statistical manifold. Note that — 1-conformal equivalence 
implies projective equivalence. A statistical manifold of dimension greater than three 
has constant curvature if and only if it is ±l-conformally flat. 

We call a function p on M. x Ai a contrast function |35j inducing (g, V, V*) if it 
satisfies 

p(p,p) =0, P eM, (A.2) 
p[X\] =p[\Y] = 0, (A.3) 
g(X,Y) =-p[X\Y], (A.4) 
g(V x Y, Z) = - p[XY\Z], g(Y, V* X Z) = -p[Y\XZ], (A.5) 

where 

p[Xt ■■■X k \Y 1 ...Y l ](p):= (X 1 ) p ■ ■ ■ (X.UY^ ■ ■ ■ (Y^p, q)\ p=q 

for arbitrary p, q G M and Xi,Yj G X{M). If (M,g,V) and (M,g',V) are 1- 
conformally equivalent, a contrast function p' inducing (</, V, V'*) is represented by 
p inducing (g, V, V*), as 

p'(p,g) = <f>(q)p(p,o)- 

Appendix B: The proof for the fact that (S n ,h, V*) is a statistical manifold 

We show that V*/i is symmetric. By the definition of — 1-conformally flatness we have 

(V* x h)(Y, Z) = Xh{Y, Z) - h(V* x Y, Z) - h(Y, V* X Y) 
= d\(X)g(Y,Z) + XXg(Y,Z) 

- X{g(V^Y, Z) + d(ln X)(Y)g(X, Z) + d(\n X)(X)g(Y, Z)} 

- X{g(Y, V^Z) + d(\nX)(Z)g(X,Y) + d(\nX)(X)g(Z,Y)}. 
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Substitute the equality Xd(\n A) = d\ into the right-hand side, then it is transformed to 

\{Xg(Y, Z) - g(V^Y, Z) - g(Y, v£>Z) 

- d(\nX)(X)g(Y,Z) - d(\nX)(Y)g(X,Z) - d(\nX)(Z)g(X,Y)} 
= Xiy^g){Y,Z) - X{d(\nX)(X)g(Y,Z) + d(\nX)(Y)g(X,Z) + d(\nX)(Z)g(X,Y)}. 

Thus, V*h is symmetric because (S n , g,V^) is a statistical manifold, i.e., V^g is 
symmetric. Since is torsion-free, so is V* by the definition of — 1-conformally 

flatness. 
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